Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 3728369 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 517.2 MiB |
| Average record size in memory | 145.5 B |
Variable types
| NUM | 11 |
|---|---|
| CAT | 3 |
| DATE | 2 |
Reproduction
| Analysis started | 2020-05-07 10:37:57.744678 |
|---|---|
| Analysis finished | 2020-05-07 12:12:00.127255 |
| Version | pandas-profiling v2.6.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
STK
Real number (ℝ≥0)
| Distinct count | 552 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3525.71763 |
|---|---|
| Minimum | 3100 |
| Maximum | 9999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 28.4 MiB |
Quantile statistics
| Minimum | 3100 |
|---|---|
| 5-th percentile | 3114 |
| Q1 | 3245 |
| median | 3524 |
| Q3 | 3739 |
| 95-th percentile | 3842 |
| Maximum | 9999 |
| Range | 6899 |
| Interquartile range (IQR) | 494 |
Descriptive statistics
| Standard deviation | 338.8045411 |
|---|---|
| Coefficient of variation (CV) | 0.0960952001 |
| Kurtosis | 58.10594476 |
| Mean | 3525.71763 |
| Median Absolute Deviation (MAD) | 229.9482183 |
| Skewness | 5.076363917 |
| Sum | 1.314517631e+10 |
| Variance | 114788.5171 |
| Value | Count | Frequency (%) | |
| 3413 | 73558 | 2.0% | |
| 3307 | 50561 | 1.4% | |
| 3851 | 35250 | 0.9% | |
| 3112 | 33364 | 0.9% | |
| 3766 | 31486 | 0.8% | |
| 3114 | 31286 | 0.8% | |
| 3243 | 27837 | 0.7% | |
| 3523 | 27576 | 0.7% | |
| 3609 | 27574 | 0.7% | |
| 3754 | 26819 | 0.7% | |
| Other values (542) | 3363058 | 90.2% |
| Value | Count | Frequency (%) | |
| 3100 | 18694 | 0.5% | |
| 3102 | 13076 | 0.4% | |
| 3103 | 2588 | 0.1% | |
| 3104 | 19890 | 0.5% | |
| 3105 | 16225 | 0.4% |
| Value | Count | Frequency (%) | |
| 9999 | 26 | < 0.1% | |
| 9998 | 1 | < 0.1% | |
| 9911 | 3 | < 0.1% | |
| 8901 | 3 | < 0.1% | |
| 8871 | 29 | < 0.1% |
DrTP
Real number (ℝ≥0)
| Distinct count | 14 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.036513285031605 |
|---|---|
| Minimum | 0 |
| Maximum | 13 |
| Zeros | 4318 |
| Zeros (%) | 0.1% |
| Memory size | 3.6 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 2 |
| Q1 | 5 |
| median | 13 |
| Q3 | 13 |
| 95-th percentile | 13 |
| Maximum | 13 |
| Range | 13 |
| Interquartile range (IQR) | 8 |
Descriptive statistics
| Standard deviation | 4.682111564 |
|---|---|
| Coefficient of variation (CV) | 0.4665077832 |
| Kurtosis | -0.8632876642 |
| Mean | 10.03651329 |
| Median Absolute Deviation (MAD) | 4.153595745 |
| Skewness | -1.032551861 |
| Sum | 37419825 |
| Variance | 21.9221687 |
| Value | Count | Frequency (%) | |
| 13 | 2471966 | 66.3% | |
| 2 | 820573 | 22.0% | |
| 12 | 211330 | 5.7% | |
| 5 | 186819 | 5.0% | |
| 3 | 17993 | 0.5% | |
| 7 | 5309 | 0.1% | |
| 0 | 4318 | 0.1% | |
| 6 | 3886 | 0.1% | |
| 11 | 2581 | 0.1% | |
| 9 | 1666 | < 0.1% | |
| Other values (4) | 1928 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 4318 | 0.1% | |
| 1 | 309 | < 0.1% | |
| 2 | 820573 | 22.0% | |
| 3 | 17993 | 0.5% | |
| 4 | 145 | < 0.1% |
| Value | Count | Frequency (%) | |
| 13 | 2471966 | 66.3% | |
| 12 | 211330 | 5.7% | |
| 11 | 2581 | 0.1% | |
| 10 | 1271 | < 0.1% | |
| 9 | 1666 | < 0.1% |
| Distinct count | 3139779 |
|---|---|
| Unique (%) | 84.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.4 MiB |
| - | 32 |
|---|---|
| 005 | 27 |
| 001 | 26 |
| 003 | 25 |
| 008 | 24 |
| Other values (3139774) |
| Value | Count | Frequency (%) | |
| - | 32 | < 0.1% | |
| 005 | 27 | < 0.1% | |
| 001 | 26 | < 0.1% | |
| 003 | 25 | < 0.1% | |
| 008 | 24 | < 0.1% | |
| 004 | 22 | < 0.1% | |
| 033 | 22 | < 0.1% | |
| 106 | 22 | < 0.1% | |
| TEST0000000000001 | 21 | < 0.1% | |
| 124 | 21 | < 0.1% | |
| Other values (3139769) | 3728127 | > 99.9% |
Length
| Max length | 22 |
|---|---|
| Mean length | 16.466887 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Uppercase_Letter | 26 | 54.2% | |
| Decimal_Number | 10 | 20.8% | |
| Other_Punctuation | 6 | 12.5% | |
| Close_Punctuation | 1 | 2.1% | |
| Dash_Punctuation | 1 | 2.1% | |
| Open_Punctuation | 1 | 2.1% | |
| Connector_Punctuation | 1 | 2.1% | |
| Space_Separator | 1 | 2.1% | |
| Math_Symbol | 1 | 2.1% |
| Value | Count | Frequency (%) | |
| Latin | 26 | 54.2% | |
| Common | 22 | 45.8% |
| Value | Count | Frequency (%) | |
| ASCII | 48 | 100.0% |
DatKont
Date
| Distinct count | 3718825 |
|---|---|
| Unique (%) | 99.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.4 MiB |
| Minimum | 2018-01-02 05:38:33.517000 |
|---|---|
| Maximum | 2018-12-31 16:07:02.557000 |
TypMot
Real number (ℝ)
| Distinct count | 62848 |
|---|---|
| Unique (%) | 1.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28893.53555 |
|---|---|
| Minimum | -1 |
| Maximum | 62846 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 14.2 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 24 |
| Q1 | 20775 |
| median | 25792 |
| Q3 | 40400 |
| 95-th percentile | 58210 |
| Maximum | 62846 |
| Range | 62847 |
| Interquartile range (IQR) | 19625 |
Descriptive statistics
| Standard deviation | 16330.73876 |
|---|---|
| Coefficient of variation (CV) | 0.5652038925 |
| Kurtosis | -0.5567911312 |
| Mean | 28893.53555 |
| Median Absolute Deviation (MAD) | 13003.13754 |
| Skewness | 0.1704521434 |
| Sum | 1.077257623e+11 |
| Variance | 266693028.5 |
| Value | Count | Frequency (%) | |
| -1 | 175087 | 4.7% | |
| 24 | 91435 | 2.5% | |
| 25756 | 36294 | 1.0% | |
| 22192 | 33405 | 0.9% | |
| 17805 | 31502 | 0.8% | |
| 26725 | 25512 | 0.7% | |
| 22621 | 25131 | 0.7% | |
| 40245 | 23539 | 0.6% | |
| 23162 | 23288 | 0.6% | |
| 21751 | 23023 | 0.6% | |
| Other values (62838) | 3240153 | 86.9% |
| Value | Count | Frequency (%) | |
| -1 | 175087 | 4.7% | |
| 0 | 1 | < 0.1% | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 62846 | 2 | < 0.1% | |
| 62845 | 1 | < 0.1% | |
| 62844 | 1 | < 0.1% | |
| 62843 | 1 | < 0.1% | |
| 62842 | 1 | < 0.1% |
TZn
Real number (ℝ)
| Distinct count | 6266 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3755.8111487891892 |
|---|---|
| Minimum | -1 |
| Maximum | 6264 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Memory size | 7.1 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 652 |
| Q1 | 2238 |
| median | 4515 |
| Q3 | 4930 |
| 95-th percentile | 5846 |
| Maximum | 6264 |
| Range | 6265 |
| Interquartile range (IQR) | 2692 |
Descriptive statistics
| Standard deviation | 1697.19653 |
|---|---|
| Coefficient of variation (CV) | 0.4518854817 |
| Kurtosis | -0.9518556151 |
| Mean | 3755.811149 |
| Median Absolute Deviation (MAD) | 1476.40286 |
| Skewness | -0.5958684209 |
| Sum | 1.400304986e+10 |
| Variance | 2880476.062 |
| Value | Count | Frequency (%) | |
| 4930 | 898984 | 24.1% | |
| 1771 | 259757 | 7.0% | |
| 4515 | 192249 | 5.2% | |
| 4173 | 180757 | 4.8% | |
| 5811 | 166904 | 4.5% | |
| 5846 | 164297 | 4.4% | |
| 1033 | 130150 | 3.5% | |
| 4008 | 113780 | 3.1% | |
| 3494 | 105270 | 2.8% | |
| 2338 | 94842 | 2.5% | |
| Other values (6256) | 1421379 | 38.1% |
| Value | Count | Frequency (%) | |
| -1 | 808 | < 0.1% | |
| 0 | 4 | < 0.1% | |
| 1 | 5 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6264 | 1 | < 0.1% | |
| 6263 | 12 | < 0.1% | |
| 6262 | 4 | < 0.1% | |
| 6261 | 3 | < 0.1% | |
| 6260 | 2 | < 0.1% |
DrVoz
Real number (ℝ)
| Distinct count | 45 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.84187214302018 |
|---|---|
| Minimum | -1 |
| Maximum | 43 |
| Zeros | 19450 |
| Zeros (%) | 0.5% |
| Memory size | 3.6 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 7 |
| Q1 | 20 |
| median | 20 |
| Q3 | 20 |
| 95-th percentile | 20 |
| Maximum | 43 |
| Range | 44 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 5.490181858 |
|---|---|
| Coefficient of variation (CV) | 0.3077133282 |
| Kurtosis | 1.806544911 |
| Mean | 17.84187214 |
| Median Absolute Deviation (MAD) | 4.117401271 |
| Skewness | -0.3956745787 |
| Sum | 66521083 |
| Variance | 30.14209684 |
| Value | Count | Frequency (%) | |
| 20 | 2705196 | 72.6% | |
| 9 | 439599 | 11.8% | |
| 7 | 200821 | 5.4% | |
| 13 | 163866 | 4.4% | |
| 11 | 46953 | 1.3% | |
| 26 | 24553 | 0.7% | |
| 29 | 21431 | 0.6% | |
| 35 | 19577 | 0.5% | |
| 0 | 19450 | 0.5% | |
| 32 | 18296 | 0.5% | |
| Other values (35) | 68627 | 1.8% |
| Value | Count | Frequency (%) | |
| -1 | 7 | < 0.1% | |
| 0 | 19450 | 0.5% | |
| 1 | 93 | < 0.1% | |
| 2 | 4808 | 0.1% | |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 43 | 46 | < 0.1% | |
| 42 | 4 | < 0.1% | |
| 41 | 10064 | 0.3% | |
| 40 | 182 | < 0.1% | |
| 39 | 3917 | 0.1% |
ObchOznTyp
Real number (ℝ)
| Distinct count | 67142 |
|---|---|
| Unique (%) | 1.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 34759.20922 |
|---|---|
| Minimum | -1 |
| Maximum | 67140 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 14.2 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 5324 |
| Q1 | 22436 |
| median | 33615 |
| Q3 | 46406 |
| 95-th percentile | 63170 |
| Maximum | 67140 |
| Range | 67141 |
| Interquartile range (IQR) | 23970 |
Descriptive statistics
| Standard deviation | 17213.43888 |
|---|---|
| Coefficient of variation (CV) | 0.4952195193 |
| Kurtosis | -0.8557945261 |
| Mean | 34759.20922 |
| Median Absolute Deviation (MAD) | 14576.68319 |
| Skewness | -0.05185456205 |
| Sum | 1.295951581e+11 |
| Variance | 296302478 |
| Value | Count | Frequency (%) | |
| 44034 | 135116 | 3.6% | |
| 27530 | 122332 | 3.3% | |
| 44043 | 85570 | 2.3% | |
| 28050 | 81507 | 2.2% | |
| 27539 | 61276 | 1.6% | |
| 27538 | 56785 | 1.5% | |
| 44041 | 49926 | 1.3% | |
| 31327 | 39592 | 1.1% | |
| 29409 | 34553 | 0.9% | |
| 44047 | 31468 | 0.8% | |
| Other values (67132) | 3030244 | 81.3% |
| Value | Count | Frequency (%) | |
| -1 | 822 | < 0.1% | |
| 0 | 1 | < 0.1% | |
| 1 | 6 | < 0.1% | |
| 2 | 8 | < 0.1% | |
| 3 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 67140 | 3 | < 0.1% | |
| 67139 | 2 | < 0.1% | |
| 67138 | 1 | < 0.1% | |
| 67137 | 1 | < 0.1% | |
| 67136 | 10 | < 0.1% |
Ct
Real number (ℝ)
| Distinct count | 135 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 43.84246865050106 |
|---|---|
| Minimum | -1 |
| Maximum | 133 |
| Zeros | 4 |
| Zeros (%) | < 0.1% |
| Memory size | 7.1 MiB |
Quantile statistics
| Minimum | -1 |
|---|---|
| 5-th percentile | 37 |
| Q1 | 42 |
| median | 42 |
| Q3 | 42 |
| 95-th percentile | 55 |
| Maximum | 133 |
| Range | 134 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 8.516253482 |
|---|---|
| Coefficient of variation (CV) | 0.1942466687 |
| Kurtosis | 22.95968419 |
| Mean | 43.84246865 |
| Median Absolute Deviation (MAD) | 4.400603515 |
| Skewness | 2.337121156 |
| Sum | 163460901 |
| Variance | 72.52657338 |
| Value | Count | Frequency (%) | |
| 42 | 2640623 | 70.8% | |
| 48 | 279634 | 7.5% | |
| 55 | 135017 | 3.6% | |
| 37 | 107751 | 2.9% | |
| 52 | 104070 | 2.8% | |
| 58 | 73324 | 2.0% | |
| 43 | 70012 | 1.9% | |
| 13 | 66544 | 1.8% | |
| 50 | 58738 | 1.6% | |
| 56 | 40783 | 1.1% | |
| Other values (125) | 151873 | 4.1% |
| Value | Count | Frequency (%) | |
| -1 | 1 | < 0.1% | |
| 0 | 4 | < 0.1% | |
| 1 | 14 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 133 | 165 | < 0.1% | |
| 132 | 1 | < 0.1% | |
| 131 | 1 | < 0.1% | |
| 130 | 12 | < 0.1% | |
| 129 | 3 | < 0.1% |
DatPrvReg
Date
| Distinct count | 21410 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 28.4 MiB |
| Minimum | 1753-01-01 00:00:00 |
|---|---|
| Maximum | 2030-12-22 00:00:00 |
| Distinct count | 495609 |
|---|---|
| Unique (%) | 13.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 158319.9791 |
|---|---|
| Minimum | 0 |
| Maximum | 9944330 |
| Zeros | 286977 |
| Zeros (%) | 7.7% |
| Memory size | 28.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 60023 |
| median | 145722 |
| Q3 | 221034 |
| 95-th percentile | 360322 |
| Maximum | 9944330 |
| Range | 9944330 |
| Interquartile range (IQR) | 161011 |
Descriptive statistics
| Standard deviation | 142909.9388 |
|---|---|
| Coefficient of variation (CV) | 0.9026652205 |
| Kurtosis | 117.5483949 |
| Mean | 158319.9791 |
| Median Absolute Deviation (MAD) | 96955.17595 |
| Skewness | 4.893037318 |
| Sum | 5.90275302e+11 |
| Variance | 2.042325061e+10 |
| Value | Count | Frequency (%) | |
| 0 | 286977 | 7.7% | |
| 1 | 2032 | 0.1% | |
| 3 | 1029 | < 0.1% | |
| 7 | 841 | < 0.1% | |
| 4 | 829 | < 0.1% | |
| 8 | 794 | < 0.1% | |
| 6 | 773 | < 0.1% | |
| 10 | 769 | < 0.1% | |
| 9 | 757 | < 0.1% | |
| 11 | 717 | < 0.1% | |
| Other values (495599) | 3432851 | 92.1% |
| Value | Count | Frequency (%) | |
| 0 | 286977 | 7.7% | |
| 1 | 2032 | 0.1% | |
| 2 | 557 | < 0.1% | |
| 3 | 1029 | < 0.1% | |
| 4 | 829 | < 0.1% |
| Value | Count | Frequency (%) | |
| 9944330 | 1 | < 0.1% | |
| 9633091 | 1 | < 0.1% | |
| 9584128 | 1 | < 0.1% | |
| 9455994 | 1 | < 0.1% | |
| 9445862 | 1 | < 0.1% |
| Distinct count | 35 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.001050862 |
|---|---|
| Minimum | 0 |
| Maximum | 44 |
| Zeros | 1685149 |
| Zeros (%) | 45.2% |
| Memory size | 28.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 3 |
| 95-th percentile | 7 |
| Maximum | 44 |
| Range | 44 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.571117076 |
|---|---|
| Coefficient of variation (CV) | 1.28488342 |
| Kurtosis | 2.998184476 |
| Mean | 2.001050862 |
| Median Absolute Deviation (MAD) | 2.037193931 |
| Skewness | 1.546167723 |
| Sum | 7460656 |
| Variance | 6.610643017 |
| Value | Count | Frequency (%) | |
| 0 | 1685149 | 45.2% | |
| 1 | 424795 | 11.4% | |
| 2 | 375968 | 10.1% | |
| 3 | 332738 | 8.9% | |
| 4 | 283926 | 7.6% | |
| 5 | 235610 | 6.3% | |
| 6 | 143948 | 3.9% | |
| 7 | 93675 | 2.5% | |
| 8 | 58921 | 1.6% | |
| 9 | 35272 | 0.9% | |
| Other values (25) | 58367 | 1.6% |
| Value | Count | Frequency (%) | |
| 0 | 1685149 | 45.2% | |
| 1 | 424795 | 11.4% | |
| 2 | 375968 | 10.1% | |
| 3 | 332738 | 8.9% | |
| 4 | 283926 | 7.6% |
| Value | Count | Frequency (%) | |
| 44 | 1 | < 0.1% | |
| 40 | 2 | < 0.1% | |
| 33 | 1 | < 0.1% | |
| 31 | 5 | < 0.1% | |
| 30 | 2 | < 0.1% |
| Distinct count | 32 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1660004683 |
|---|---|
| Minimum | 0 |
| Maximum | 37 |
| Zeros | 3481154 |
| Zeros (%) | 93.4% |
| Memory size | 28.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 37 |
| Range | 37 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.8030010426 |
|---|---|
| Coefficient of variation (CV) | 4.837342031 |
| Kurtosis | 74.10206896 |
| Mean | 0.1660004683 |
| Median Absolute Deviation (MAD) | 0.3099871253 |
| Skewness | 7.258377676 |
| Sum | 618911 |
| Variance | 0.6448106743 |
| Value | Count | Frequency (%) | |
| 0 | 3481154 | 93.4% | |
| 1 | 101735 | 2.7% | |
| 2 | 54733 | 1.5% | |
| 3 | 36185 | 1.0% | |
| 4 | 22482 | 0.6% | |
| 5 | 13134 | 0.4% | |
| 6 | 7503 | 0.2% | |
| 7 | 4466 | 0.1% | |
| 8 | 2665 | 0.1% | |
| 9 | 1652 | < 0.1% | |
| Other values (22) | 2660 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3481154 | 93.4% | |
| 1 | 101735 | 2.7% | |
| 2 | 54733 | 1.5% | |
| 3 | 36185 | 1.0% | |
| 4 | 22482 | 0.6% |
| Value | Count | Frequency (%) | |
| 37 | 1 | < 0.1% | |
| 32 | 1 | < 0.1% | |
| 31 | 1 | < 0.1% | |
| 28 | 1 | < 0.1% | |
| 27 | 2 | < 0.1% |
| Distinct count | 15 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.009282074816 |
|---|---|
| Minimum | 0 |
| Maximum | 23 |
| Zeros | 3705130 |
| Zeros (%) | 99.4% |
| Memory size | 28.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1350141733 |
|---|---|
| Coefficient of variation (CV) | 14.54568897 |
| Kurtosis | 831.5233199 |
| Mean | 0.009282074816 |
| Median Absolute Deviation (MAD) | 0.01844843891 |
| Skewness | 21.48757874 |
| Sum | 34607 |
| Variance | 0.01822882699 |
| Value | Count | Frequency (%) | |
| 0 | 3705130 | 99.4% | |
| 1 | 15291 | 0.4% | |
| 2 | 5600 | 0.2% | |
| 3 | 1688 | < 0.1% | |
| 4 | 440 | < 0.1% | |
| 5 | 133 | < 0.1% | |
| 6 | 51 | < 0.1% | |
| 7 | 14 | < 0.1% | |
| 8 | 11 | < 0.1% | |
| 11 | 4 | < 0.1% | |
| Other values (5) | 7 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 3705130 | 99.4% | |
| 1 | 15291 | 0.4% | |
| 2 | 5600 | 0.2% | |
| 3 | 1688 | < 0.1% | |
| 4 | 440 | < 0.1% |
| Value | Count | Frequency (%) | |
| 23 | 1 | < 0.1% | |
| 14 | 2 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 11 | 4 | < 0.1% | |
| 10 | 1 | < 0.1% |
VyslSTK
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.6 MiB |
| 2 | |
|---|---|
| 0 | 228059 |
| 1 | 26259 |
| -1 | 48 |
| Value | Count | Frequency (%) | |
| 2 | 3474003 | 93.2% | |
| 0 | 228059 | 6.1% | |
| 1 | 26259 | 0.7% | |
| -1 | 48 | < 0.1% |
Length
| Max length | 2 |
|---|---|
| Mean length | 1.000012874 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 3 | 75.0% | |
| Dash_Punctuation | 1 | 25.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
VyslEmise
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 3.6 MiB |
| 3 | |
|---|---|
| 0 | |
| 2 | 6484 |
| 1 | 918 |
| Value | Count | Frequency (%) | |
| 3 | 2397529 | 64.3% | |
| 0 | 1323438 | 35.5% | |
| 2 | 6484 | 0.2% | |
| 1 | 918 | < 0.1% |
Length
| Max length | 1 |
|---|---|
| Mean length | 1 |
| Min length | 1 |
| Value | Count | Frequency (%) | |
| Decimal_Number | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| Common | 4 | 100.0% |
| Value | Count | Frequency (%) | |
| ASCII | 4 | 100.0% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| STK | DrTP | VIN | DatKont | TypMot | TZn | DrVoz | ObchOznTyp | Ct | DatPrvReg | Km | ZavA | ZavB | ZavC | VyslSTK | VyslEmise | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3122 | 2 | JMZBLA2G601258504 | 2018-01-02 11:03:12.833 | 46194 | 3423 | 20 | 5324 | 42 | 2011-02-10 | 84818 | 0 | 0 | 0 | 2 | 0 |
| 1 | 3205 | 2 | 4150417 | 2018-01-02 11:06:07.617 | 35476 | 3744 | 7 | 26295 | 13 | 1989-01-01 | 38828 | 0 | 0 | 0 | 2 | 0 |
| 2 | 3114 | 2 | VF3MJAHXHGS280168 | 2018-01-02 11:15:08.083 | 21890 | 4173 | 20 | 5621 | 42 | 2017-01-09 | 39227 | 0 | 0 | 0 | 2 | 0 |
| 3 | 3618 | 2 | 4699845 | 2018-01-02 11:19:22.967 | 2656 | 4930 | 20 | 1668 | 42 | 1979-06-04 | 38951 | 0 | 0 | 0 | 2 | 0 |
| 4 | 3748 | 2 | WF0SXXGCDSAU06730 | 2018-01-02 11:30:25.420 | 41041 | 1771 | 20 | 29420 | 42 | 2010-06-29 | 254194 | 0 | 0 | 0 | 2 | 0 |
| 5 | 3846 | 2 | JTJBC11A402443427 | 2018-01-02 11:26:50.967 | 8184 | 3152 | 20 | 50282 | 42 | 2012-09-24 | 130258 | 0 | 0 | 0 | 2 | 0 |
| 6 | 3307 | 5 | W0922S235HNZ18070 | 2018-01-02 13:15:50.550 | -1 | 6103 | 26 | 16527 | 56 | 2017-07-27 | 0 | 0 | 0 | 0 | 2 | 0 |
| 7 | 3755 | 5 | TMBRD75L8A6012628 | 2018-01-02 12:11:56.770 | 26690 | 4930 | 20 | 65317 | 42 | 2009-12-28 | 218933 | 0 | 0 | 0 | 2 | 3 |
| 8 | 3124 | 2 | WV2ZZZ7HZ9H079590 | 2018-01-02 11:57:15.020 | 25572 | 5846 | 20 | 20363 | 42 | 2008-12-08 | 235865 | 0 | 0 | 0 | 2 | 0 |
| 9 | 3710 | 13 | WV2ZZZ7HZFH062377 | 2018-01-02 11:54:17.927 | 26447 | 5846 | 20 | 59044 | 42 | 2015-01-19 | 59557 | 0 | 0 | 0 | 2 | 3 |
Last rows
| STK | DrTP | VIN | DatKont | TypMot | TZn | DrVoz | ObchOznTyp | Ct | DatPrvReg | Km | ZavA | ZavB | ZavC | VyslSTK | VyslEmise | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3728359 | 3603 | 13 | VF1BG0G0628995706 | 2018-03-14 14:58:12.507 | 38678 | 4515 | 20 | 36839 | 42 | 2003-12-22 | 279124 | 4 | 0 | 0 | 2 | 3 |
| 3728360 | 3633 | 13 | TMBDH25J783008811 | 2018-03-14 15:08:20.890 | 25792 | 4930 | 20 | 27530 | 42 | 2008-03-12 | 74961 | 1 | 0 | 0 | 2 | 3 |
| 3728361 | 3839 | 13 | TMBHG41U6Y2396796 | 2018-03-14 15:44:41.603 | 21751 | 4930 | 20 | 44065 | 42 | 2000-07-03 | 164377 | 1 | 0 | 0 | 2 | 3 |
| 3728362 | 3637 | 2 | JMB0NV240SJ010467 | 2018-03-14 15:48:07.160 | 11609 | 3583 | 20 | 45096 | 42 | 1998-05-15 | 159055 | 1 | 1 | 0 | 0 | 0 |
| 3728363 | 3114 | 13 | JN1CBAN16U0004800 | 2018-03-14 16:28:35.963 | 54549 | 3834 | 20 | 15210 | 42 | 2001-06-12 | 235115 | 2 | 0 | 0 | 2 | 3 |
| 3728364 | 3746 | 13 | VF37J9HXCAJ723769 | 2018-03-15 06:55:47.593 | 20477 | 4173 | 20 | 45238 | 42 | 2010-08-24 | 185318 | 3 | 0 | 0 | 2 | 3 |
| 3728365 | 3734 | 5 | XLRAE45CF0L255548 | 2018-03-15 08:28:13.880 | 26903 | 1197 | 9 | 14801 | 50 | 2004-03-22 | 381915 | 2 | 0 | 0 | 2 | 3 |
| 3728366 | 3128 | 13 | WDF63960313008173 | 2018-03-15 08:10:16.180 | 15456 | 3494 | 9 | 61376 | 48 | 2004-01-07 | 314543 | 6 | 0 | 0 | 2 | 3 |
| 3728367 | 3506 | 13 | WBAAT91090KS22634 | 2018-03-15 11:22:12.270 | 6776 | 652 | 20 | 6039 | 42 | 2004-10-18 | 261757 | 6 | 0 | 0 | 2 | 3 |
| 3728368 | 3105 | 12 | TMBJB16Y823352312 | 2018-03-15 08:29:56.967 | 22621 | 4930 | 20 | 27530 | 42 | 2002-01-24 | 65648 | 5 | 0 | 0 | 2 | 3 |